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We have created heralded coherent state superpositions (CSS), by subtracting up to three photons 
from a pulse of squeezed vacuum light. To produce such CSSs at a sufficient rate, we used our high- 
efficiency photon-number-resolving transition edge sensor to detect the subtracted photons. This is 
the first experiment enabled by and utilizing the full photon-number-resolving capabilities of this 
detector. The CSS produced by three-photon subtraction had a mean photon number of 2.75^0.24 
and a fidelity of 0.59io;i4 with an ideal CSS. This confirms that subtracting more photons results in 
higher-amplitude CSSs. 
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A coherent state of the electromagnetic field is often 
considered the most classical-like pure state, but a su- 
perposition of two coherent states with opposite phases 
has interesting quantum features. For example, coher- 
ent state superpositions (CSS) can be exploited for per- 
forming quantum information tasks and high precision 
measurements. CSSs are also of fundamental inter- 
est: When they contain many photons they are super- 
positions of macroscopically distinguishable states often 
called "Schrodinger cat states" . Schrodinger's Gedanken 
experiment of 1935 described a cat apparently held in a 
superposition of alive and dead states [I], but many re- 
searchers now use "Schrodinger cat" to refer to a quan- 
tum state that is a superposition of two highly distin- 
guishable classical states such as a CSS of high amplitude 
or mean number of photons [2] . CSSs have been prepared 
in traveling optical modes with a mean of up to 2.0 op- 
tical photons by heralding [3HZ]- With sufficiently high 
quality and well characterized CSSs, one can in principle 
quantum compute using simple linear optical components 
and homodyne measurements [8 . Less ambitiously, they 
can serve as fiying qubits for quantum communication. 
In addition to potentially simple processing, advantages 
of CSSs in traveling optical modes include fast linear ma- 
nipulations, transport over large distances, robustness if 
loss is controlled, and simple conversion to entangled op- 
tical states, all at room temperature. 

The CSSs that we discuss here are superpositions of 
two coherent states | ±q;) of a single mode of light, where 
+a and —a are the states' complex mode amplitudes. 
Our experiments aim to prepare two special instances 
of these CSSs: the odd and even CSSs defined as the 
superpositions |— Q;)±|a) (unnormalized) . These are dis- 
tinguished by having only even (-I-) or odd (— ) numbers 
of photons. For \a\ ^ 1, the states' mean number of 
photons, (n), is approximately |ap. Two quality mea- 



sures for experimental CSSs are the fidelity of the created 
state with the nearest ideal CSS and the magnitude of 
the amplitude of this ideal CSS. There are two reasons 
to aim for large amplitude CSSs. The first is that to 
be useful for superresolution metrology, the probability 
Pq = 1 — exp(— 2|q;P) with which the superposed coher- 
ent states can be distinguished must be close to one. To 
achieve po > 0.99 requires |q;| > 1.52. The second is that 
a minimum size estimated as \a\ > 1.2 is required for 
fault tolerant quantum computing |^. Because opera- 
tion close to the lower bound is unrealistic due to exces- 
sive resource requirements, we are motivated to produce 
bigger CSSs. Similarly, high fidelity is required to avoid 
excessive overheads for eliminating unwanted errors due 
to deviations from an ideal CSS. The highest fidelity CSS 
achieved so far has |a| = 1.1 and a fidelity F = 0.76 [7], 
while the largest has an effective size of |a| = 1.4 and 
fidelity F = 0.60 We have created CSSs with ampli- 
tudes and fidelities' of \a\ = 1.76lai'g and F = 0.59l:o;;t 
and |a| = 1.32i;j:!!^ and F ^ 0.522l!l:lli't Unlike the exper- 
iment reported in [7], our CSSs are generated in pulsed 
rather than continuous-wave mode. Pulsed operation is 
required for many applications to avoid the effects of light 
in neighboring modes in subsequent manipulations and 
measurements of the states. 

To create the CSSs, we used the photon subtraction 
scheme depicted in Fig. [T] A squeezed vacuum state is 
prepared and sent through a weakly refiecting beam split- 
ter. Reflected photons that are detected herald an ap- 
proximate CSS in the transmitted beam. Because higher 
amplitude and fidelity CSSs can be created by herald- 
ing on detecting multiple photons at once [101 E]; we 
used a photon-number-resolving transition edge sensor 
(TES) [IS [13] for subtracting two or three photons. The 
TES used in our experiment has an efficiency of 85 ± 2 % 
and can resolve up to 10 photons. This enabled obtain- 
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FIG. 1. (color online) Scheme for optical coherent state su- 
perposition (CSS) creation. An upconverted laser pulse enters 
an optical parametric amplifier (OPA) to create a squeezed 
vacuum state in section (A). After spectral filtering, this state 
is sent to a weakly reflecting beamsplitter R in (B). Refiected 
photons that are detected herald a CSS emerging from R into 
(C). Its quadratures are measured by homodyne detection in 
(C). 

ing higher amplitude CSS at practical rates. We also 
subtracted one and two photons using avalanche photo- 
diodes (APDs) for comparison. 

For the experiments, we used a cavity-dumped 
861.8 nm laser with transform-limited pulses of 140 fs 
(typical), pulse energies of 40 nJ and a repetition fre- 
quency of 548 kHz. A fraction of each pulse with > 10® 
photons was used as the local oscillator (LO) in the ho- 
modyne detector. The rest pumped a temperature-tuned 
150 fim KNbOa crystal to generate a second- harmonic 
pump pulse (efficiency 25 %) for the optical parametric 
amplifier (OPA) shown in Fig. [l] The OPA consists of 
a temperature-tuned 200 /xm long KNbOs crystal. We 
determined that the squeezed vacuum state generated 
can be modeled as a pure squeezed state with minimum 
quadrature variance Vq = —6.8 dB subjected to a loss of 
7s = 0.36. We define the squeezing purity as 77s — 1 — 7s. 
We used a variable beam splitter (R in Fig. [T]) made 
with a half-wave-plate and a polarizing beamsplitter and 
configured to send from 2.5 % (one-photon subtraction) 
to 20 % (three-photon subtraction) of the light to the 
photon subtraction arm. Photons in this arm were spec- 
trally filtered by a fiber Bragg grating with a bandwidth 
of 1.5 nm in a polarization-based circulator before being 
coupled to the photon detector/counter. The other arm 
of the variable beam splitter delivers the heralded CSS 
to a conventional homodyne detector for measuring the 
quadrature at the phase of the LO. The CSS temporal 
shape is significantly different from that of the original 
pump due to the large mismatch in group velocity in 
our KNbOa crystals. To compensate, we expanded the 
temporal width of the LO by up to a factor of 2 with 
a pulse-shaping setup [2]. The phase of the LO was 
adjusted by a piezo-mounted mirror displaced at a fre- 
quency of 2.75 Hz with a saw-tooth profile to obtain a 



complete phase space measurement of the CSS. Further 
technical details are in [15] . 

We reconstructed the states produced by photon sub- 
traction immediately after the subtracting beam split- 
ter by maximum likelihood quantum state estimation as 
discussed in Ref. [TB]. For this purpose, we considered 
the homodyne measurement setup including all of its 
losses such as those associated with the initial beamsplit- 
ter and imperfect spatial mode matching to the LO as a 
monolithic lossy quadrature measurement. This requires 
knowing the loss 7/1, which we experimentally determined 
to be 7/1 = 15 ± 2 %. The uncertainty in propa- 
gates to an uncertainty in the reported CSS parameters. 
In particular, the fidelities differ by up to ±0.02 if the 
boundary values for 7^ are used. However, the main un- 
certainty in our state reconstructions is due to finite sam- 
ple statistics. We estimated this statistical uncertainty 
by parametric-bootstrap resampling |17j . We report in- 
ferred values such as fidelities in the form F^^J^j^^ , where 
F is the fidelity of the maximum likelihood estimate from 
the experiment's data, U is the 84**^ percentile of the fi- 
delities of the states estimated from resampled data sets, 
and L is the 16**^ percentile. We obtained 100 resampled 
data sets for one- and two-photon subtraction and 1000 
for three-photon subtraction. There is a significant bias 
toward more mixed states in the resampling procedure 
and the amount of bias increases with the purity of the 
state from which samples are generated. We did not cor- 
rect for this bias in our reconstruction of the states, but 
note that it suggests that the true fidelities are above the 
reported ones. 

The reconstructed states have well-defined average 
photon numbers, (n). The reported amplitudes are those 
of the nearest even or odd CSS, which is found by max- 
imizing the fidelity with respect to the reconstructed 
state. The reported fidelities are these maximized ones. 
Table H] summarizes our results. 

Fig. [2] shows the reconstructed Wigner function for a 
one-photon-subtracted state heralded by an APD. The 
quantum character of this state can be identified by its 
negativity near the origin of the Wigner function, whose 
minimum has a value of Wmin = — 0.0411!!:ooi. The state's 
fidefity is F = 0.522t!!;o;o with respect to an odd CSS with 
\a\ — 1.32tom- This fidelity is higher than the maximum 
fidelity of = 0.487 that any coherent state can have 
with the \a\ = 1.32 odd CSS. (Note that this is also the 
highest fidelity that any mixture of coherent states can 
have. The maximum fidelity of a coherent state with a 
CSS depends on the CSS's |a| and whether the CSS is 
even or odd. As \a\ increases, this fidelity approaches 
0.5 from above for even CSSs but from below for odd 
CSSs.) The amplitude of the CSS is notably larger than 
the |a| = 0.88, = 0.70,11^^111 = -0.13 state described 
in Ref. [5| . The lower fidelity is primarily due to a lower 
squeezing purity 77^ in our experiment. 

We obtained even CSSs by two photon subtraction. 
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FIG. 2. (color online) Maximum likelihood estimate of an odd 
CSS generated by one-photon subtraction from a squeezed 
vacuum. The graph shows the unitless Wigner function value 
W{q,p) as a function of the unitless quadratures of the elec- 
tromagnetic field. 



TABLE L Results for the one-, two- and three-photon sub- 
traction experiments. W^min and{n) are the minimum value 
and the mean photon number of the reconstructed state, re- 
spectively. F is the fidelity of the reconstructed state com- 
pared to a theoretical CSS with amplitude \a\ 
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We performed two experiments, the first used a TES, 
the second used two APDs at the two outputs of a 50/50 
beamsphtter. For the APDs, coincidence heralds the 
presence of two photons in the subtraction arm. The 
reconstructed states are shown in Fig. [3j The TES mea- 
surement yielded a smaller CSS (|aTEs| = LlOla!") than 
the APD measurement (laAPol = 1.30±o;o2). The fideli- 
ties are Ftes = 0.5311^11 and Fapd = 0.523t'^,°,H For 
comparison, the maximum fidelity of coherent states with 
an \a\ = 1.16 (|a| = 1.30) even CSS is 0.552 (0.522). Ear- 
lier studies |7| showed the continuous wave generation of 
even CSSs with \a\ = 1.41 and F = 0.60. 

The fidehty of the heralded CSSs is aff'ected not only 




FIG. 3. (color online) Wigner functions of the maximum like- 
lihood estimates of even CSS created by two-photon subtrac- 
tion and heralded with (a) one transition edge sensor and (b) 
two multiplexed APDs. 



by low squeezing purity, but also by unwanted photons 
not matching the LO mode but still visible to the detec- 
tors. In addition to stray light (which can in principle be 
controlled) such photons come from temporally similar 
modes that are also squeezed in the OPA. When squeezed 
light is produced by down-conversion of a pulsed pump 
laser, multiple spatial-temporal modes may be squeezed, 
and none of these modes is guaranteed to match the mode 
of the LO [18^ . These other modes have similar spectra to 
the LO mode and therefore cannot be conventionally fil- 
tered. Detections due to photons in these modes degrade 
the fidelity of the CSSs. We quantify the effect of un- 
wanted photons with the "modal purity" ^„ of n photon 
subtraction - the probability that, when the subtraction 
detector registers n photons, these n photons were from 
the mode matching the LO. To estimate the modal puri- 
ties, we used a single-mode photon subtraction model to 
fit our data 15J. From this we determined ^2,tes — 0.62 
and ^2. APD = 0.85, compared to = 0.91 for the one- 
photon subtraction experiment. The reason for the lower 
modal purity of the TES experiment is its greater sen- 
sitivity to stray photons from the LO. With the APDs, 
we can gate the detections to reject slightly delayed LO 
photons arising from downstream reflections. The TES 
is slower, so such gating is not possible. 

The main advantages of the TES are the greater ef- 
ficiency and the ability to directly count photons. In 
the two-photon subtraction experiments, this higher effi- 
ciency resulted in improving the rate at which CSSs were 
heralded by a factor of three. 

Three-photon subtraction events are extremely rare in 
our experiment. Nevertheless, using the TES we were 
able to detect 1087 three photon events over a period of 
approximately 60 hours. With three multiplexed APDs 
we would have collected only about 120 events. Fig. |4] 
shows the odd CSS. To increase the three photon event 
rate, we increased the reflectivity of the photon subtrac- 
tion beam splitter to 20 %, sacrificing the fidelity of 
the CSS. The reconstructed state shows a negative min- 
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Ideal CSS |a| = 1 .76 




FIG. 4. (color online) Maximum likelihood estimate of 
an odd CSS after the subtraction of three photons from a 
squeezed vacuum. The reconstructed state has a fidelity of 
F = 0.592;°o.f42 with a CSS of amplitude \a\ = 1.76^^9. In- 
set: Wigner function of an ideal odd CSS with |q| — 1.76 

imum of its Wigner function M^min — — O.liel'aol'g and a 
mean photon number of 2.75to'il. The state has fidehty 
F = 0.59l!!:;i with an ideal CSS of |a| = 1.76t!!;?l The 
estimated modal purity in this experiment is ^3 = 0.84. 
Thus, we observed the predicted increase in CSS ampli- 
tude for three-photon subtraction, but the increase in 
fidelity is not statistically significant. 

In conclusion, we have measured heralded optical 
CSSs created by subtracting up to three photons from 
a squeezed vacuum state, using APDs for one- and 
two-photon subtraction and a TES for two and three. It 
was only by taking advantage of the high efficiency and 
the direct photon counting capability of the TES that 
we were able to successfully subtract three photons with 
a sufficiently high rate of CSS production. The CSSs 
produced were analyzed by homodyne measurement and 
maximum-likehhood state estimation. The quality of 
the CSSs can be improved by reducing the losses expe- 
rienced by the squeezed vacuum state before reaching 
the photon-subtraction beam spfitter. For multi-photon 
subtraction, however it is crucial to reduce the presence 
of unfilterable photons in unwanted modes. A promising 
route that addresses both problems is to tailor the 
squeezing source to create squeezed light only in a single 
mode matched to the LO. This route is being pursued in 
the photon-pair generation community [I9l - |2T| . Based 
on our findings, we propose that the combination of 
pure vacuum squeezing and high efficiency detectors 
with photon-number-resolving capabilities can yield 
high rate, amplitude and fidelity CSSs to support 



quantum information processing and metrology beyond 
the quantum limit. 

Added note: Recently, the authors became aware of a 
similar measurement that made use of photon-number- 
resolving transition edge sensors |22) . 

This work was supported by the NIST Innovations in 
Measurement Science Program. T.G. thanks P. Grangicr 
and A. Ourjoumtsev for discussions. This is a contribu- 
tion of NIST, an agency of the U.S. government, not 
subject to copyright. 
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This supplementary material gives more technical experimental details and deeper 
insight into the analysis than was given in the main paper. First, we give a detailed 
description of the experimental setup and our efforts to improve the fidelity of our 
squeezed vacuum and coherent state superpositions (CSSs). Then we present an 
analytical model of subtraction of up to three photons from a squeezed vacuum. Last we 
give a detdled analysis of experimental parameters and results. 

I. Experimental Setup 

Figure Al shows our experimental setup. We use a cavity-dumped femtosecond 
laser with transform-limited pulses of t5^ically 140 fs duration and a repetition frequency 
of 548 kHz. The center wavelength of the cavity dumper output is Xo = 861.8 nm. Typical 
pulse energies are 40 nJ at the output port of the cavity dumper. We spatially filter the 
laser beam by sending it through a 30 |a,m diameter pinhole (PH). At the 90/10 beam 
splitter (BS) we split the laser beam into two parts. The weaker part of the beam is the 
strong local oscillator (LO) for the homodyne detection (>10^ photons/pulse). The 
stronger part pumps a 150 |im thick KNbOs crystal (SHG) to generate the second- 
harmonic pump photons with a conversion efficiency of 25%. To eliminate any 
fundamental photons from the laser itself, we spectrally filter (SF) the second-harmonic 
pump. Then, the pump is focused into a 200 |Xm thick down-converting KNbOa crystal 
(OPA). Both crystals are temperature-tuned with stability better than 0.05°C for optimum 
phasematching. We optimized both crystal temperatures to achieve purest squeezing 
(TopA = 28.3°C; Tshg = 27.5°C). During the course of one measurement we observe a 
decrease in the squeezing from the OPA, which is probably caused by a photorefractive 
effect present in our crystals [1]. This decrease typically happens within the first hour of 
the measurement and remains stable afterwards if we keep the pump focused at the same 
position. Our optics in the homodyne detection arm after the down-converter eliminate 
most (>99.9%) of the pump, and its contribution to the homodyne signal is negligible. 
Therefore, no further spectral filtering is required after the OPA. 
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FIG. Al. Experimental Setup. The experiment setup consists of six main sections, labeled A-F. A: second- 
harmonic conversion of fundamental beam; B: squeezed vacuum generation; C: photon subtraction; D: 
spectral filtering of subtracted photons and detection thereof; E: temporal pulse-shaping of local oscillator; 
F: homodyne detection. The inset (a) shows the polka dot filter used in the pulse-shaping setup. Inset (b) 
shows the charge-integration circuit for homodyne detection, and inset (c) shows quadrature data for a full 
scan of the phase of a CSS created by one photon subtraction. PH: pinhole; BS: beam splitter; SHG: 
second-harmonic generation; SF: spectral filter; OPA: down-conversion crystal; HWP: half-wave-plate; 
PBS: polarizing beam splitter; QWP: quarter-wave-plate; FBG: fiber Bragg grating; #n: photon-number- 
resolving detector; PD: photodiode; HDC: homodyne detection circuit; ^. Piezo stage; G: grating; L: lens; 
PDF: polka dot filter 



The squeezed vacuum is sent to our photon- subtraction beam- splitting components. The 
beam sphtter consists of a half-wave-plate and a polarizing beam splitter cube 
(HWPi/PBSi). This arrangement is helpful, as we can adjust the beam splitter's 
transmissivity, rather than having a fixed splitting ratio. The s-polarized waves 
experience less than 1% loss under reflection and the p-polarized waves undergo a 5% 
loss due to transmission through the polarizing beam splitter. The reflected photons are 
sent to the homodyne detection arm, and hence the loss in this arm is minimized. The 
subtracted photons are directed to our spectral filter setup. It consists of a fiber Bragg 
grating (FBG) and a circulator. The FBG has a bandwidth of AXfwmh =1-5 nm. The 
circulator is a combination of a free-space polarizing beam splitter and quarter-wave- 
plate (PBS2/QWP). This setup allows for good filtering of the subtracted photons. 
However, two fiber couplers are in the path of the subtracted photons. This decreases the 
overall detection efficiency of the subtracted photons due to the limited fiber coupling 
efficiency. The subtracted photons that are within the FBG bandwidth reach the photon- 
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number-resolving detector with a probability of about 20%. We used two different 
photon detectors: avalanche photo diodes (APDs) for the one and two photon subtraction, 
or one transition-edge sensor (TES) for the two and three photon subtraction experiments. 
Upon detection of at least one photon at the photon detector, we know that we have 
prepared an approximation of a CSS. Homodyne measurements are recorded regardless 
of whether a photon is subtracted or not. When no photons are subtracted, a noisy 
squeezed state is created, which we use to calibrate the phase of the LO. When a photon 
subtraction event occurs, the coincident homodyne measurement is tagged. 

The temporal width of the strong LO is controlled by a pulse-shaping setup [2] so 
that we can compensate for the large mismatch in group velocity in our KNbOa crystals, 
which is -1.2 ps/mm [3]. This group velocity mismatch causes the temporal width of the 
squeezing pulse to be about double that of the LO. Using two gratings (G), two lenses (L) 
and a spatial filter (PDF), we can tune the temporal width of the LO from 140 fs to about 
300 fs. The spatial filter is a polka dot filter (PDF) with Gaussian transmission profile to 
minimize possible chirp imposed by the pulse-shaping setup. We lithographically made 
these polka dot filters on chromium masks. In order to minimize and randomize the 
interference of the LO, which was sent through the PDF, we randomly distributed 4 |i.m x 
4 |im squares along the z-axis of the filter. The squares' density changed according to a 
Gaussian envelope transmission profile along the x-axis of the filter. The filter design is 
shown in inset (a) of figure Al. Different LO temporal widths are achieved by different 
PDF designs. The highest fidelity CSS results were obtained for an LO width of 230 fs. 
However, this is about 40 fs shorter than the width that results in the most pure squeezed 
states, where the squeezing is measured by direct homodyne detection. The phase of the 
LO is adjusted by a piezo-mounted mirror {(j)). We continuously displace the mirror with 
a frequency of 2.75 Hz. The mirror's displacement is a saw-tooth profile with amplitude 
of about IAq. This allows a complete phase space measurement of our CSS. The LO and 
prepared CSS are combined at PBS3. A half-wave-plate (HWP2) and a polarizing beam 
splitter (PBS4) constitute the 50/50 beam splitter necessary for the homodyne detection 
setup. Adjustment of HWP2 allows for very accurate balancing of the homodyne 
detection system. Both photodiodes are high-speed pin-Si photodiodes with high 
detection efficiency. Out of a set of 10 same- wafer photodiodes, we chose the two 
photodiodes that have the best matching temporal response. The electrical homodyne 
circuit is shown in inset (b) of figure Al. It consists of a charge integration stage and a 
lOx amplification stage. The charge integration includes one field effect transistor (FET) 
bridged across the integrating capacitor. The FET circuit induces a random offset charge. 
We correct for this charge by subtracting the voltage before from the voltage after each 
laser pulse. An intensity plot of a quadrature probability distribution for a one photon 
subtraction experiment is shown in inset (c). The quadrature data consist of 324,000 
heralded events. These data are then processed, and maximum likelihood estimation 
gives the density matrix of the measured state. 

The overall efficiency of the homodyne detection is /^h = 0.853 + 0.028. We 
estimate this by separately measuring four efficiencies rio, T]^, rj^, and T]^ and computing 
7h = 7o7d7w7e- The four efficiencies are defined as follows: 1) The efficiency of all 
optical elements after the photon subtraction beam splitter to the face of the homodjTie 
detector's photodiodes is rjo = 0.94 + 0.005. 2) The mean efficiency of the photodiodes is 
Tjd = 0.976 + 0.022, where the difference in efficiency between the two photodiodes is 
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less than 0.5%. 3) The efficiency of the mode matching between the squeezed mode and 
the LO is 7]^ = 0.95 + 0.005. Tj^ was determined by interfering a probe beam with the LO. 
However, the probe beam does not travel through the up-conversion crystal and its 
temporal width has not been altered. Therefore, for this measurement we did not alter the 
LO's tempord width. The LO is simply sent through the pulse shaper without a polka dot 
filter in place. (We are unable to measure the mismatch between the temporal width of 
the CSS and the LO, so it is not included in rjh.) The error bar for T]^ accounts only for 
the statistics of the measurement, not for likely systematic biases. The effect of such 
biases on the inferred states is well below the states' statistical error as discussed in the 
paper. 4) tje is the efficiency that is formally equivalent to the electronic background 
noise. The electrical background noise of the homodyne detectors and electronics is e = 
Ve/(Ve + Vv) = 0.021 + 0.001, where Vg is the variance of voltages measured when no light 
enters the photodiodes and Vg + K is the variance observed when only the local oscillator 
is present (Vg + K includes both electronic noise and shot noise of the LO). This is 
formally equivalent to an efficiency rje = (i-e) = 0.979 + 0.001 [4]. Our calibration of the 
overall efficiency of the homodyne detection, r/h, does not include any excess noise from 
the down-conversion process itself, nor does it include the reflectivity of the beam splitter 
used for photon subtraction. 

We model the squeezed vacuum generated in the experiment as a pure squeezed 
vacuum state with squeezed quadrature variance Vo, which has passed through a medium 
with transmissivity rj^. This state is then measured with the homodyne system whose 
efficiency is hence the measured efficiency is = (When measuring the 

squeezing directly, we set the photon subtraction beam splitter reflectivity to 0.) We can 
cdculate 7m and Vo by use of [5]: 

(i-^J=7ji-n), (Ai) 



and 



2-V-V • '^'> 

1 P 



where Vp is the observed squeezed quadrature's variance, and Vq is the observed anti- 
squeezed quadrature's variance. When we measured the squeezing by homodyne 
detection directly, we observed Vq = 3.129 (+5.0 dB) and Vp = 0.565 (-2.5 dB). After 
correcting for the overall homodyne detection efficiency rjh, we found = 0.64 and an 
inferred squeezing variance of Vo= 0.205 (-6.8 dB), based on equations Al and A2. 



II. Photon subtraction from squeezed vacuum (model calculations) 



In the following, we present an analytical model of the photon subtracted 
squeezed states. The model is similar to the model published by Ourjoumtsev et aL[l]. 
However, it considers up to three photons subtracted from the squeezed vacuum and does 
not assume that the efficiency of the photon subtraction detector is small. 
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FIG. A2. Schematic of the model for calculation of the two-mode Wigner function. A squeezed state is sent 
into a beam splitter (BSi), with transmissivity Tj^, to model the loss of the input noisy squeezed state. BS2 
models the photon subtraction at a beam splitter with reflectivity R. BS3 represents the loss in the photon 
subtraction mode given by the subtraction arm efficiency (ji). The two triangles represent ideal photon 
counters, one of which detects k photons in mode 3, and the other detects n - k photons in orthogonal modes 
(all of which we label as mode 6). BS4 models the known optical efficiency in the homodyne subtraction 
arm which includes the optical loss, photodiode efficiency, wavefront overlap and electrical 
background. 



Unlike the model in [6], we do not consider multimode interference effects - we treat 
photons in modes not matched to the local oscillator as if they are equivalent to dark 
counts in the subtraction detector. The APDs are unable to distinguish the n photon 
subtraction events from n + I or higher numbers, so in this experiment and our model we 
accept a heralding of an n photon subtraction event when the detector registers n or more 
photons. Although the TES detectors can distinguish n from n + I photons we use the 
same model to analyze the TES experiments. Because the frequency of the n + I events is 
so much smaller than the n photon events, their contribution is negligible. Figure A2 
shows a schematic for the model. Beginning with the pure squeezed state IS) (with 
squeezed variance Vq) in mode 1 and vacuum in modes 2, 3, 4, and 5 we calculate the 
state of this system as it evolves through the beam splitters, trace out the lost modes, and 
project mode 3 onto a k photon Fock state. We will account for the effect of dark counts 
in the subtraction detectors and photons in other modes entering the subtraction detector 
by including a second virtual photon counter, which registers n - k photons, so that the 
total number of photons detected is n. 



6 



Initially, this system is in the state 

A,.3,4,5=|^>J0),|0)3|0>J0>^X5|^{0|^(0|3(0L(0|^(0|. (A4) 

We apply beam splitters 1 through 4 to this system and then perform the partial trace over 
modes 2, 4, and 5, leaving the state of modes 1 and 3: 

A,3 = Tr2,4,5 [Bi,4 (7h )B3,5 (mAs " ^Al k )PiX3aA,2 iVs As " ^^,5 (mAa ^ )\ " 

(A5) 

Next we project mode 3 onto the k photon Fock state 



where P{k)=Tx[i{k\p\,i\k)z\ is the probability that pi,3 contains k photons in mode 3. 
Suppose mode 6 contains x photons with probability q{x). The probability that the two 
detectors together register n photons is 

Q{n)=Y^q{n-k)P{k), (A7) 
and the resulting state is 

^1 = 7^ S - ^)^(^)a {k) . (A8) 

We accept a heralding of an m photon subtracted state whenever m or more photons are 
detected, so we consider the statistical mixture of all <Ti(n) for n>m: 

TM = -^f^Q{n)cTM (A9) 
where the probability to detect at least m photons is 

m-l 

S{m) = ^Q{n) = l-^Q{n). (AlO) 

n=m n=0 



In fact Ti(m) does not depend on all of the details of the distribution q(x), and we simplify 
these expressions using modal coupling parameters <^„, for all n such that n is an integer, 
£ind 0<n<m. We define the ^„ as 
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{ m-\ 



S{m) 



1 - ^ q{s) for n = 0, 



(All) 



S{m) 



V s=0 



for 1 < n < m. 



Note that ^„ = 1 > and the modal coupling p£ir£imeters can be interpreted as 
probabilities. Using the modal coupling parameters we can rewrite Zi(m) as 



n=0 \k=n J 



n=0 



(A12) 



k=0 



where />id=Tr3(yCi3). We call the "modal purity" because it is the probability that the m 
or more observed photons were actually subtracted from the mode matched to the local 
oscillator. 

In practice we calculate these states and their associated probabilities by use of 
their Wigner functions, according to the methods described in [7]. Eventually we obtain 
W{q, p), the Wigner function for Ti(m), which depends on Vq, R, jU, the ^„'s, and m. 
We then perform a least squares fit of these Wigner functions to the Wigner functions 
reconstructed by maximum likelihood from the homodyne measurements to obtain 
estimates for the parameters Vo, and the ^„'s. From separate measurements we know R, 
jU and r/h, and fix these parameters in the fitting routine. A package to calculate the 
Wigner functions based on the above model can be obtained from the authors upon 
request. 



III. Experimental findings 

We have used the above model to fit our experimental data and obtain the 
experimental parameters. Figure A3 shows the Wigner functions obtained by maximum 
likelihood reconstruction from the homodyne data. We used a maximum number of 20 
photons for the reconstruction algorithm. Figure A3 also shows Wigner functions 
obtained by fitting our model to the maximum Ukelihood reconstructions. Table I lists our 
experimental findings. The beam splitter reflectivities were adjusted from 2.5 % to 20 % 
in the different photon-number-subtraction experiments. Higher fidelities are predicted 
for lower reflectivities, but we increased the reflectivity to increase the frequency of 
higher photon number subtraction events. For the fitting routine, the overall detection 
efficiency ju was set to 0.17 for the TES and to 0.08 for the APD measurements. Inferring 
other experimental parameters by fitting the model described above to the one photon 
subtraction measurements, we find = 0.72, Vo = 0.229 (-6.4 dB), Vq = 3.423 (+5.3 dB), 
Vp = 0.445 (-3.5 dB) and = 0.91. 




FIG. A3. I Wigner function plots of CSSs created in our experiments, (a)-(d) One, Two and Three photon 
subtraction maximum UkeHhood reconstruction and model-fit, respectively. 
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Table I Experimental findings. Vo> fjs and ^„ were obtained from a least squares fit of the above model, 
a, (m)(x and Fwere obtained by comparing the maximum likelihood state estimate with a theoretical CSS 
that gave highest fidelity. (n)p is the average photon number in the reconstructed state. "Wrmn is the 
minimum of the reconstructed Wigner function. The photon subtraction beam splitter reflectivity R was 
determined by a separate measurement. 



The model finds a better fit using a higher squeezing purity than is directly measured 
with homodyne detection. This may be attributed to using our single-mode model to 
describe our multi-mode states. A thorough investigation based on the multi-mode model 
in [6] may clarify this discrepancy. The fits to the data reveal a constant squeezing purity 
of about 7s = 0.72 for all one and two photon subtraction experiments. When we fit the 
model to the three photon subtraction data, we find that a squeezing purity of = 1 
provides the best fit. However, we know from other measurements that rjs < 0.75. The 
failure of the model in the three photon subtraction experiment may be caused by 
numerical difficulty obtaining the correct fit and/or multimode effects [6]. The modal 
purity of the subtracted photons is at least 0.84, except for the two photon TES 
experiment, where spurious LO photons scattered into the TES contribute to 38% false 
heralds, as determined from the modal purity. The scattered LO photons arrive 5 ns after 
the true signal photons; the delay is determined by the beam paths. Therefore, gating the 
APD with a window smaller than 5 ns suppresses the spurious LO contribution. This 
gating is possible only because of the APD's small jitter (=^400 ps). The TES (jitter ~ 100 
ns) does not allow for such accurate gating. Therefore the modal purity of the subtracted 
photons is lower in the TES case. Note that the lower modal purity could be improved. In 
our case the spurious photons originate from a reflection off one output port of the 
polarizing beam splitter that combines the CSS and the LO (PBS3 in figure Al). For 
example, we could use a slightly wedged output port surface which would ensure the 
reflection into a spatial mode that is orthogonal to the subtraction cirm's spatial mode. 
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The reason for the higher modal purity in the three photon subtraction experiment is that 
after increasing the subtracting beam splitter's reflectivity, the rate of subtracting three 
"good" photons was increased while the rate of detecting scattered LO photons was 
decreased. When reporting the fidelity of the states produced in our experiment, we 
maximize the fidelity over all ideal CSSs, obtaining the amplitude a of the highest 
fidelity CSS. The mean photon number <n)a of that CSS is calculated via 



where ^ = for an even CSS and ^ = 7t for an odd CSS. 
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